Categorical understanding using statistical ngram models
نویسندگان
چکیده
In this paper, the speech understanding problem in the context of a spoken dialog system is formalized in a maximum likelihood framework. Word and dialog-state n-grams are used for building categorical understanding and dialog models, respectively. Acoustic con dence scores are incorporated in the understanding formulation. Problems due to data sparseness and out-of-vocabulary words are discussed. Incorporating dialog models reduces relative understanding error rate by 1525%, while acoustic con dence scores achieve a further 10% error reduction for a computer gam-
منابع مشابه
Phrase and Ngram-Based Statistical Machine Translation System Combination
Multiples translations can be computed by one machine translation (MT) system or by different MT systems. We may assume that different MT systems make different errors due to using different models, generation strategies, or tweaks. An investigated technique, inherited from automatic speech recognition (ASR), is the so-called system combination that is based on combining the outputs of multiple...
متن کاملModeling of Long Distance Context Dependency
Ngram models are simple in language modeling and have been successfully used in speech recognition and other tasks. However, they can only capture the short distance context dependency within an n-words window where currently the largest practical n for a natural language is three while much of the context dependency in a natural language occurs beyond a three words window. In order to incorpor...
متن کاملRecursive Path Models when Both Predictor and Response Variables are Categorical
Recursive path analysis is a useful tool for inference on a sequence of three or more response variables in which the causal effects of variables, if any, are in one direction. The primary objective in such analysis is to decompose the total effect of each variable into its direct and indirect components. Methods for recursive analysis of a chain of continuous variables are well developed but t...
متن کاملShould substance use disorders be considered as categorical or dimensional?
AIMS This paper discusses the representation of diagnostic criteria using categorical and dimensional statistical models. Conventional modeling using categorical or continuous latent variables in the form of latent class analysis and factor (IRT) analysis has limitations for the analysis of diagnostic criteria. METHODS New hybrid models are discussed which provide both categorical and dimensi...
متن کاملHead-Driven Parsing for Word Lattices
We present the first application of the head-driven statistical parsing model of Collins (1999) as a simultaneous language model and parser for largevocabulary speech recognition. The model is adapted to an online left to right chart-parser for word lattices, integrating acoustic, n-gram, and parser probabilities. The parser uses structural and lexical dependencies not considered by ngram model...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999